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Abstract 



Approximate random fc-colouring of a graph G = (y, E) is a very well studied problem in computer 
science and statistical physics. It amounts to constructing a fc-colouring of G which is distributed 
l/-v \ close to Gibbs distribution, i.e. the uniform distribution over all the fc-colourings of G. Here, we deal 

with the problem when the underlying graph is an instance of Erdos-Renyi random graph G{n,p), 
where p = d/n and d is fixed. 

We propose a novel efficient algorithm for approximate random fc-colouring with the following 
properties: given an instance of G{n,d/n) and for any fc > (2 + e)d, it returns a fc-colouring 
distributed within total variation distance n^^^^'' from the Gibbs distribution, with probability 1 — 



n-^^(i). 



What we propose is neither a MCMC algorithm nor some algorithm inspired by the message 
passing heuristics that were introduced by statistical physicists. Our algorithm is of combinatorial 
nature. It is based on a rather simple recursion which reduces the random fc-colouring of G{n, d/n) 
to random fc-colouring simpler subgraphs first. 
C^^ i The lower bound on the number of colours for our algorithm to run in polynomial time is dra- 

^P I matically smaller than the corresponding bounds we have for any previous algorithm. 

l> 

f^ • Key words: Random colouring, sparse random graph, efficient algorithm. 
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1 Introduction 

Approximate random fc-colouring of a graph G = {V, E) is a very well studied problem in computer 
science and statistical physics. It amounts to constructing a fe-colouring of G which is distributed close 
to Gibbs distribution, i.e. the uniform distribution over all the fc-colourings of G. Here, we deal with the 
specific algorithmic problem when the underlying graph is an instance of Erdos-Renyi random graph 
G{n,p), where p = d/n and d is fixed. 

The most powerful and most popular algorithms for this kind of problems aie based on the Markov 
Chain Monte Carlo (MCMC) method. There the main technical challenge is to establish that the under- 
lying Markov chain mixes in polynomial time (see ||6l). The work in HI (which improved Q) shows 
that the well known Markov chain Glauber dynamics for A;-colourings has polynomial time mixing for 
typical instances of G{n, d/n) as long as the number of colours k is larger than k{d), a number which 
depends only on the expected degree of G(n, d/n), d. 

Notably, both HI |3l overcame the "maximum degree obstacle" from which most techniques for 
analysing the mixing time of the Glauber dynamics suffer, i.e. they are stated in terms of the maximum 
degree of the graph. This makes them insufficient for G{n, d/n) where the maximum degree grows as 
0((logn)/(loglogn)) with probability 1 — o(l). 

Recently physicists proposed some heuristics for computing deterministically marginals of Gibbs 
distribution. These heuristics are based on the message passing algorithm Belief Propagation (see \T}) 
and ideally they could be used for random colouring G{n, d/n). There, the main challenge is to show 
that the computation of the marginals is accurate. In turn this amounts to establishing certain spatial 
correlation decay conditions for the Gibbs distribution. We should remark that the heuristics proposed 
by statistical physicists, namely Belief Propagation guided decimation and Survey Propagation guided 
decimation, were put forward on the basis of very insightful but highly non-rigorous statistical mecha- 
nics considerations (see ||2l). The work in H presents an efficient algorithm which is a variation of 
Belief propagation and returns an approximate random A;-colouring of a typical G(n, d/n) as long as 
k > k'{d), where k'{d) is a number which depends only the expected degree (i.e. k'{d) = d^'^). 

In this work we propose a novel algorithm for approximate random /c-colouring G{n, d/n) which 
not only overcomes the "maximum degree obstacle" but somehow optimizes the dependence of the 
minimum number of colours from the expected degree d. The lower bound on the number of colours 
for our algorithm to run in polynomial time is dramatically smaller than the corresponding bounds we 
have for any previous algorithm on the problem. The algorithm does not fall into any of the previous 
two categories, i.e., it is neither MCMC nor based on Gibbs marginals computation. 

We are based on the following humble observation: Let {v, u} be an edge of Gn^d/n- A random 
colouring of Gn^d/n can be seen as a random colouring of G„^/„\{u, u} with the additional property 
that V and u are assigned different colours. Assume that we have a polynomial time algorithm, call it 
STEP, such that given any graph G and two non-adjacent vertices v, u it transforms a random colouring 
of G to a random colouring which has the extra property that v and u take different colours. In that 
case, the initial problem can be reduced to taking a random fc-colouring of G„^/„\{t), u} and then use 
STEP. The reasonable question, then, would be how can someone colour Gn^d/n\{v, u} randomly. We 
can set up a recursion by applying the previous reduction for G„^/„\{i;, u} and so on. Note that as the 
recursion proceeds the structure of the graph that is considered gets simpler and simpler. This is due to 
the edge deletions. Clearly, after a certain number of recursive calls the graph becomes so simple that it 
can be /c-coloured randomly in polynomial time by some known algorithm. 

A great deal of this work illustrates the implementation of STEP in the special case where the input 
graph G is a typical instance of Gn^d/n or any of its subgraphs that are considered in the recursion. STEP 
will be an approximation algorithm, i.e. given a random colouring of G in the input the distribution of 
the output will be an approximation of the desired one. Consequently, at the end we get an approximate 
random /c— colouring of G(n, d/n). 



We use total vaiiation distance as a measure of distance between distributions. 
Definition 1 For the distributions Ua, fh on [k]^ , let \\va — VhW denote their total variation distance, i.e. 

\\T^a-Vb\\= max \Va{9.') - Vb{^')\. 

For A C y /e? ||fa — i^^IIa denote the total variation distance between the projections of Va cmd v^, on 

[k]\ 

STEP will have the following general property: Consider in the input a random A;-colouring of some 
graph G and v, u, two non-adjacent vertices of G. The accuracy of the outcome depends on certain 
spatial mixing properties of the Gibbs distribution of the colourings of G. In particular, for a random 
/c-colouring of G it suffices that there is a sufficiently large 6 > such that 



Prlu is coloured ch is coloured 



< exp {—b • dist{v,u)) yc,qG[k]. 



(1) 



Moreover, assuming that ([T|l holds, then the distribution of the output of STEP is within total variation 
distance from the ideal distribution a quantity which is proportional to the r.h.s. of ([T])- Consequently, 
when we consider the previous recursive random colouring algorithm (that uses STEP), we note that it 
is desirable to delete edges that belong to long cycles in each recursive call. 

We show that for a typical G{n, d/n) and for A; > (2 + e)d, where e > is fixed, we get a relation 
as in ([T]) for the random /c-colourings of any graph in the recurrence. Moreover, if we are careful enough 
on how do we delete the edges in the recuiTcnce, the outcome of the random colouring algorithm is very 
close to Gibbs distribution. In particular, we show the following theorem. 

Theorem 1 Let fi be the uniform distribution over the k-colourings ofGn.d/n ^'^^ '^^ /"' be the distribu- 
tion of the colouring that is returned by our random colouring algorithm. Taking k> {2 + e)d, for fixed 
e > 0, then with probability at least 1 — n so log d /j holds that 



0(n 9oi°g 



\fi- fi 



for any fixed d > dQ{e). 

Additionally, we provide guarantees on the time complexity of the algorithm. 

Theorem 2 With probability at least l—n^"^'^, it holds that the time complexity of the random colouring 
algorithm is 0{v?). 

Detailed proofs of Theorem [T] and Theorem |2] appear in the appendix. Section lAl 

Notation We denote with small letters of the greek alphabet the colourings of a graph G, e.g. o", -q, r, 
while we use capital letters for the random variables which take values over the colourings e.g. X, Y, Z. 
We denote with a^ the colour assignment of the vertex v under the colouring a. Similarly, the random 
variable X{v) is is equal to the colour assignment that X specifies for the vertex v. Finally, for an integer 

A; > let [k] = {1, . . . , k]. 



2 Basic Description 

In this section we provide a more detailed description of our approximate colouring algorithm. We as- 
sume that the input graph is an instance of G{n, d/n) and k is the numbers of colours. 

Set up. Consider a sequence of graphs Gq, . . . ,Gr such that every Gi is a subgraph of Gn^d/n- Moreover, 
Gr is identical to Gn^d/n^ while Gi is derived by deleting some edge of Gj+i. 

So as to get the graph Gi from Gj+i the only rule we follow is that we delete, arbitrarily, an edge 
that belongs to a sufficiently large cycle, i.e of length at least (log77,)/(91ogd). Go is the graph that 
comes up when there no are other such edges to delete. Note only that Gi, as a subgraph of G{n, d/n), 
is somehow random. 

Colouring. With probability 1 — n~^(^\ the sequence of subgraphs has the property that Gq is simple 
enough and we can A;-colour it randomly in polynomial time by using some known algorithm. In that 
case the algorithm takes a random colouring of Gq. Then, for i = to r — 1 it does the following: it 
takes the random colouring of Gj, it does a simple, i.e. polynomial time, processing of this colouring 
and gets a random colouring of Gj+i. The algorithm continues until Gr- 

Let G and G' be two consecutive terms in the sequence of graphs, above. Assume that G is derived 
by deleting the edge {v, u} from G' . The critical question is the following one: Given X, a random k- 
colouring of G, how can someone use it to get efficiently X' , a random /c-colouring of G'. A moment's 
reflection makes it clear that if X has the additional property that X{v) ^ X{u), then X is distributed 
u.a.r. among the /c-colourings of G' . In this case we can simply set X' = X. Unfortunately, this cannot 
always be the case and the random colouring algorithm we propose somehow deals with situations as 
the one where X{v) = X{u). 

Definition 2 (Good & Bad colourings) Let a be a proper k-colouring ofG. We call a a bad colouring 
ofG ifcy = (Tu- Otherwise, we call a a good colouring ofG. 

It turns out that the basic algorithmic challenge here is captured in the following problem. 

Problem 1 Given a bad random colouring ofG, turn it to a good random colouring, in polynomial time. 

Let us give an intuitive description of our algorithm for the above problem. First remark the following: 
Consider a, some /c-colouring of G, and some q G [k] such that a^ / q. It is easy to see that a specifies 
a connected subgraph of G which includes v while every vertex in this subgraph is assigned colouring 
either q or a^. The maximal induced subgraph of this kind is called "disagreement graph'Lj. Figure [T] 
shows a 3-colouring. The fat lines indicate the disagreement graph specified by using the colour "g". 

It is direct to show that the disagreement graph that is specified by the colouring a and the colour q 
is always a connected, bipartite graph whose parts ai^e coloured a^, and q, respectively. 

Definition 3 Assume that a, a k-colouring of G, and q € [A;]\{cr.u} define the disagreement graph Q. 
The k-colouring of G, a' is called "q-switching of a" if it is derived from a by switching the colour 
assignments of the vertices of G that correspond to the two parts of Q. 

In Figure |2] we present the "r^-switching" of the colouring in Figure [T] It is direct that for the colouring 
a and for some q € [A;]\{(T^,} there is a unique (/-switching of a. Also, it straightforward to show that 
the g-switching of any proper /c-colouring of G is a proper colouring, as well q 

Generally the g-switching of a bad colouring is not always a good. However, given some technical 
conditions which hold with probability 1 — ?i~^(^) over the choices of G, we show the following, non- 
trivial, statement 



'For a more formal definition of "disagreement graph" see in Section lSTI 
^ E.g. see proof of Lemma[2] 





Figure 1: "Disagreement graph". 



Figure 2: "g-switching". 



The distribution of the g-switching of Z, a bad random fc-colouring of G, is very close to the 
distribution of the good random fc-colourings of G, when the colour q is chosen uniformly at 
random from [A;]\{Z(i')} and k is sufficiently large. 

The above fact suggests that we can have the the following approximation algorithm for Problem[T]when 
G is a "typical" instance: Let X be a random colouring G. If X is good, then set X' = X. If X is a 
bad, then choose at random some q € [k]\{X{v)} and set X' to be equal to the g-switching of X. 

Remark. The algorithm in the previous paragraph is exactly the one we refer in the introduction as 

STEP. 

Returning to the approximate random colouring algorithm, we can build upon STEP as follows. First, 
colour randomly Go with some known algorithm. Then, for i = to r — 1 do the following: If the 
colouring of G, is good, then consider it as the colouring for Gj+i. Otherwise, choose appropriately a 
random colour q and set as a colouring for the graph Gj+i the g-switching of the colouring of Gj. 

The above is a concise description of our approximate random colouring algorithm. Clearly it is 
efficient and accurate only for typical instances of the input graph G{n, d/n), i.e. it has the properties 
described by Theorem [T] and Theorem [2l 



2.1 Some further remarks 

To get a better intuition about the algorithm STEP we focus on a case where things go wrong, i.e. 
consider the following. Let a be bad colouring of G, i.e. a^, = cju- It is possible that the disagreement 
graph specified by a and some colour g to be so large that it contains both v and u. In this case the 
g-switching of o" is a bad colouring. Clearly in this case STEP fails to generate a good colouring of G. 
Moreover, it is possible to have good colourings of G that cannot be generated by applying the algorithm 
STEP to any bad colouring of G. Such colourings constitute pathological cases for the algorithm. These 
pathological cases do not cause big problem as long as they occur rarely, i.e. the fraction of colourings 
of G that causes such situation is sufficiently small. The occurrences of pathological cases are rare when 
k is large and v, u are far apart. 



3 Problem [T] and a -isomorphism 

STEP uses the idea of g-switching so as to achieve a certain kind of mapping between bad and good 
colourings. Ideally this mapping should have the property that, for a bad random colouring of G on 
the input, the image should be a good random colouring of G. Unfortunately the g-switchings (as 
implemented by STEP) do not have this property but somehow they approximate such mapping. We 
introduce few notions which capture the essence of these ideas. For the the following definitions in this 



section consider a fixed grapii G and let Q. be the sets of its proper fc-colourings |5 

Definition 4 (Isomorphism) We let ili, (^2 Q 0. We say that ili is isomorpiiic to O2 if and only if there 
is a bijection T : fti ^ ^2- 

Tlie basic property of isomorpiiism we need here is contained in the following corollary. 

Corollary 1 Assume that we have two isomorphic sets ili and Q2 cmd let T be a bijection between these 
two sets. Then, given Xi, a random member of ^i, the distribution ofT{Xi) is the uniform over ^2- 

Theproof of Corollary [J appeai^s in Section ICTTl The previous definition of isomorphism is standai^d and 
generally it expresses a notion of "similarity". We will need to get a bit further from this, i.e. we intro- 
duce a more general notion of "similarity" between sets of colourings which we call a-isomorphism. 

Definition 5 (a-isomorphism) We let ili, ^2 ^ il and a E [0, 1]. We say that ili is a-isomorphic to 
Q.2 if there are sets Vl'^ C rii and ilg ^ ^2 such that 

• \n[\ > {1 - a)\ni\, for i = 1,2. 

• il.'^ and ilg ^''^ isomorphic. 

We call {Q,[, ilg) as the isomorphic pair of Q,i and 0,2- 

Thus, rather than asking for the whole sets ili and U2 to be isomorphic, a-isomorphicity requires only 
sufficiently large subsets from each of ili and 0,2 to be isomorphic. The notion of a-function, that 
follows, is for a-isomorphism the analogous of the bijection for isomorphism. 

Definition 6 (a-function) For some a G [0, 1], let ili, ^2 ^ ^b^ two sets such that Q,i is a-isomorphic 
to 0,2 with isomorphic pair (ri'^, f^g). Let h : 0.^ ^ 0,2 be a bijection. Then the function iJ : ili — )• [k]^ 
is called a-function if and only if Vo" € 0,'^ it holds that H{a) = h{a). 

Note that we can be a bit loose on the definition of an a-function when the input a does not belong 
to Q,[, i.e. we allow the a-function to take any value in [k]^ . Showing that two sets ili and O2 are 
a-isomorphic reduces to providing a function which has the properties stated in Definition [6l 

Typically, we are given two sets of /c-colourings of G, e.g. Qi and Q2, and we will be asked to devise 
an a-function which then suggests that these Oi and ^^2 are a-isomorphic. The challenge is to devise an 
a-function H which complies to the following efficiency rules: First, given some cr G ili we want H{cj) 
to have as few different colour assignments from a as possible, while the vertices with the different 
colour assignments should be as close to each othere as possible. Second, the smaller a and k are, the 
better. The g-switchings we introduced in the previous section are examples of a-functions between 
certain sets of colourings of G. The next lemma states the most important property of a-isomorphism 
and somehow it generalizes Corollary [T] 

Lemma 1 Assume that the set Q,i is a-isomorphic to Q.2, <^nd H : Q,i ^ [ky is an a-function. Let 
z be a random variable distributed uniformly over Qi and let z' = H(z). Denote by v the uniform 
distribution over ^2 cind v' the distribution of z'. It holds that 

Wv — v'W < a. 

The proof of Lemma [T] appears in the appendix. Section ICT] 



''Take k sufficiently large that Q. is non-empty. 



3.1 Dealing with Problem [T] 

In this section we focus on STEP. For clarity reasons we describe the algorithm by assuming that the 
graph G in Problem [T]is some general fixed graph. A basic part of the presentation involves relating the 
accuracy of STEP to a-isomorphism between certain sets of A;-colourings of G. 

Let us introduce some notation. Let $7 denote the set of /c-colourings of G and for c,qG [k] we 
let Q{c, (?) C J7 denote all the fc-colourings of G that assign v and u the colours c, q, respectively. We 
define formally a disagreement graph as follows: 

Definition 7 (Disagreement grapii) For a & ^ and some q G [A;]\{cr^} we let the disagreement graph 
QcTv,q — (^'i ^') ^^ ^'^^ maximal induced subgraph ofG such that 



V' = IxGV 



3 path wq, . . . ,wt, in G such that: 
Wo =v,wt= X, a{wj) G {av,q} 

It is important to remember that the disagreement graph is always connected, bipartite and maximal, 
i.e. for every a and q, G has no vertex y ^ Qa^.q which has a neighbour in V' and at the same time 
(Ty G {(Tt,, q}. Furthermore, we define formally the g-switchings as a function H : Q. x [k] ^ [^]^> i-C- 
H{a, q) returns the q-switching of a. 

Function H 

Input: X eVLwdqe [k\\{X{v)} 

S&ic = X{v). 

Set Vi = {we Qx{v),q\X{w) = c}. 

Set V2 = {w(^ Qx[v),q)\X{w) = q}. 

\/w G Vi set X{w) = q. 

Vw G V2 set X{w) = c. 
Output: X 

We have reached to the point where we have all the definitions we need to describe the algorithm STEP. 
STEP 

Input: X G ri, and k 

If X is a good colouring of G, then set Y = X. 

If X is a bad colouring of G, then choose q u.a.r. from [fc]\{X(i;)} and set Y = H{X, q). 
Output: Y 

As far as the accuracy of STEP is concerned we have to show that if X is a bad random A;-colouring of G, 
then H{X, q), as calculated by the algorithm, is distributed sufficiently close to the desired distribution 
To this end o-isomorphism comes into use. 

For any c,q£ [k] we let S{c,c) C Q(c,c) and S{q,c) C 0(g, c) be defined as follows: The set 
S{c, c) contains every a G Q{c, c) with the property that the disagreement graph Qay,q does not contain 
the vertex u. Similarly, S{q, c) contains every a G r2(g, c) such that the disagreement graph Qa^^c does 
not contain the vertex u. We show that these two sets are isomorphic. 

Lemma 2 For any c,q(z [k] with c ^ q, it holds that S{c, c) and S{q, c) are isomorphic and the 
function H{-,q) : S{c,c) — )■ S{q,c) is a bijection. 

The proof of Lemma [2] appears in the Section IC2l Based on the previous consideration, we provide a 
general relation between a-isomorphism and the accuracy of STEP. We make the following assumption. 



Assumption 1 For some a G [0,1] it holds that ^{c,c) is a-isomorphic to Q{q,c) with a-function 
H{-,q), for any c,q£ [k] such that c ^ q. 

Clearly, {S{c,c), S{q,c)) is the isomorphic pair of the a-isomorphism between Q,{c,c) and Q{q,c). 
Assumption [Uimposes an upper bound for the number of pathological colourings □ of G. It implies that, 
for any c, g G [k] all but an a fraction of the colourings in ^{q, c) do not have disagreement graph Qg c 
which includes both v and u. The same should hold for Q{c, c) for disagreement graph Qc,q- 

Theorem 3 Let v he the uniform distribution over the good k-colourings of G. Let, also, v' be the di- 
stribution of the output o/STEP when the input colouring is distributed uniformly over the k-colourings 
of G. Under Assumption\l\it holds that 

where a is defined in Assumption\l\ 

The proof of Theorem [3] appears in the appendix, Section IC.3I The impact of Assumption [T] to the 
accuracy of STEP is apparent. 

The value of a in Assumption [T]depends on G, k and the function H. The natural way of considering 
that r2(c, c) is a-isomorphic to Q.{c, q) is mainly as a consequence of the a-function H{-, q). That is, 
the two sets have this property because we have devised a mapping, the H{-, q), which happens to be 
an a-function between the two sets. Consequently, someone could device a "better" function, i.e. an 
a'-function for ^{c, c) and Q.{c, q) such that either a' < a, or a' = a but allowing smaller k, or both. 

Since the algorithm STEP implements the a-function, the performance of the a-function reflects the 
performance of the algorithm itself. Clearly, the a-function should be computable in polynomial time. 

Lemma 3 For a graph G = [V^E) and some integer k, the time complexity of computing the function 

H{;q)isO{\E\). 

The proof of Lemma [3] appears in Section IC4l 

4 From the algorithm Step to Random Colouring. 

Here, we give a general presentation of the approximate random colouring algorithm, which builds 
upon STEP. We also study properties of the algorithm like time complexity and accuracy. In particular, 
we study the accuracy of the algorithm under general assumptions about a-isomorphism, as we did in 
Section [XT] for STEP. As in the previous cases, the input graph G is considered to be fixed. 

First, we extend the notation of the previous section to fit here. For input graph G the algorithm 
considers the sequence of subgraphs Go, Gi, . . . , G^. Gj is derived by deleting from Gj+i an edge 
which we call {vi^Ui}. Let ilj be the set of fc-colourings of Gj. For any c,q & [k] we let r2j(c, q) be the 
set of colourings of Gj which assign the colours c and q to the vertices Vi and Uj, respectively. 

We proceed by describing the full algorithm in pseudocode. The variable Yi, below, denotes the 
fc-colouring that the algorithm assigns to the graph Gj. 

Random Colouring Algorithm 

Input: G, k. 

Compute Go, Gi . . . , Gr-. 

Compute Yq. /* Get a random k-colouring ofGo-*/ 

For 0<i<r — Ido 

Set Yj+i the output of STEP with input Yi. 



see also discussion in SectionlZT 



Output: Yr. 

In the second line the algorithm computes the sequence of subgraphs and in the third it colours randomly 
Gq. a detailed description of how can someone construct the sequence of subgraphs and colour ran- 
domly Go is a graph specific problem. For the case where the input graph is an instance of G{7i, d/n), 
we give a detailed treatment of in the proof of Theorem [T] and Theorem [2l^[ However, using Lemma |3] it 
is direct to get the following theorem. 

Theorem 4 Under the condition that Gq can be k-coloured randomly in polynomial time, the random 
colouring algorithm has polynomial time complexity. 

The next issue we have to investigate is the accuracy of the algorithm. As in Section [XT] we relate the 
accuracy of the random /c-colouring algorithm with a-isomorphism by using the following assumption. 

Assumption 2 For i = 0, . . . , r — 1 and some a S [0, 1] it hold that fli{c, c) is a-isomorphic to Oj(g, c) 
and H{-,q) is a a-function, for any c,q & [k] such that c ^ q. 

The a-function H is the same as the one defined in Section [3?T] Let (5j(c, c), Si{q, c)) be the isomorphic 
pair of the a-isomorphism between rij(c, c) and Oj(g, c). The sets 5j(c, c) and Si{q, c) ai^e defined in the 
same manner as S{c, c) and S{q, c), in Section [3711 From Assumption[2l we get the following theorem. 

Theorem 5 Let fi be the uniform distribution over the k-colourings of the input graph G. Let, also, 
fi' be the distribution of the colourings that is returned by the random colouring algorithm. Under 
Assumption [2] it holds that 

where r is the maximum index in the sequence Go,Gi, . . . , Gr. 
The proof of Theorem [5] appears in the appendix, Section [C31 

5 Proof sketch for Theorem [1] 

Due to space limitations, in the remaining pages we give a proof sketch our main result. Theorem [T] 
That is, we consider the random colouring algorithm with input an instance of G{n, d/n) and we let k 
be the number of colours. From a technical perspective there are two issues to deal with. The first is 
how do we construct the sequence of subgraphs. The second is to replace the rather general Assumption 
[2] about a-isomorphism between colour sets with specific results for the graphs Gq, Gi, . . . , G,.. 

When the algorithm constructs the sequence of subgraphs it should take into consideration the pre- 
vious remark that it is preferable in the graph Gj the vertices vi and Ui to be at a sufficiently large 
distance. To see why we need this property we provide the following corollary, which follows directly 
from previous definitions. 

Corollary 2 Consider some fixed graph Gi and c,q£ [k]. The set rij(c, c) is a-isomorphic to 0,i(q, c) 
with a-function H{-,q) if and only if the following holds: Choose u.a.r a colouring from ^i{c,c) and 
let Qc.q be the disagreement graph specified by this colouring and q. It should hold that 

a> Pr[vi,UieQc,q\Gi]. (2) 

Additionally, the analogous condition should hold for a random colouring ofQi{q, c). 



'See Section|Al 



Since we are interested in the minimum possible value for a, we try to minimize the probability term 
in ^. Clearly, the greater the distance between Vi and n, the less probable is for Qc^q to include them 
both and, consequently, the more accurate the random colouring algorithm gets. To this end we use the 
following lemma to construct the sequence of subgraphs. 

Lemma 4 With probability at least 1 — n~'^'^ we can have the sequence Go, . . .Gr satisfying the fol- 
lowing two properties. 

1. Gq consists only of isolated vertices and simple cycles, each of maximum length less than °^" 



2. In Gi, the graph distance between Vi and Ui is at least q°^" 



igd- 

Additionally it holds that 

Pr[r > (1 + n~^/^)dn/2] < exp(-n^/^). 

In the rest of the analysis of the algorithm we assume that the sequence of subgraphs is such that the 
distance between Vi and Ui is at least 7logn, where 7 = (91ogd)~^, for i = 0, ... ,r — 1. Since 
Go, . . .Gr are subgraphs of Gn^d/n^ somehow they are random too. The reader should feel free to 
assume any arbitrary rule that generates G-i from each instance of G(?i, d/n). The only restriction we 
have is that of the distance between Vi and Uj. Then, we use the following theorem. 

Theorem 6 Take k > (2 + e)d, where e > and d is a sufficiently large fixed number. There is /3j such 
that for any a > f3i it holds that Qi{c, c) is a-isomorphic to i7j(c, q) and H(-, q) is an a-function while 



E[Pi] <C -n V^+45iogd 



for any c,q£ [k], C > is fixed and i = 0, . . . ,r — 1. 

The expectation of the quantity /3j is over the graph instances G{n, d/n). Taking ro = {l + n~^/^)dn/2, 
Theorem [5] implies that 



E[\\fi-m<E 



.j=0 



< '^E[l3i\r < ro] +n^Pr[r > ro]. 



i=0 



In the last inequality we use that /3i G [0, 1]. It easy to see that E[/3i\r < ro] < Pr ^[r < ro] ■ E[l3i]. 
Then, Theorem|6]and Lemma|4]suggest that there is fixed C > such that £■ [| |// — /x| |] < C ■ n~45iogd. 
The theorem follows by applying the Markov inequality. 

5.1 Proof sketch for Theorem |6] 

Consider some fixed instance of Gi and let c,qG [k] such that c ^ q. Choose u.a.r. a colouring 
from ilj(c, c) and let Qcg be the disagreement graph that is specified by the colouring we chose and q. 
Similarly, choose u.a.r. from Qi{q, c) and let (5<j,c be the disagreement graph specified by the chosen 
colouring and c. According to Corollary |2j Qi{c, c) is a-isomoprhic to ^{q, c) for any a > Pi such that 

/3j = Pr[vi, Ui G Qc,q\Gi] + Pr[vi,Ui G Qq^c\Gi]. Taking the average over Gi we have 

E[(3i] = Pr[vi, Ui G Qc,q] + Pr[vi, Ui G Qg,c]. (3) 

We provide a bound on the probability terms in ^ by using the following proposition. 

Proposition 1 Take k > (2 + e)d, for fixed e > 0. Let a be a k-colouring of Gi that is chosen u.a.r 
among U^i^\]Ali{c, d). For some q G [A;]\{c} we let the event Ai ="vi and Ui G Qa^ q "■ There is a 
positive constant G such that 



Pr[Ai] <G-n V"^45iogd; ^^g 



Note that in the above proposition we take a disagreement graph of a colouring chosen u.a.r. among the 
colourings of an instance of Gi that assign the vertex Vi the colour c (the colouring of ui is "free"). We 
show that choosing a u.a.r. a colouring from \J^i^\i.]p.i{c, c') the probability p for this colouring to be in 
rjj(c, c) is constant, i.e. p = 0(1). Then, the law of total probability suggests that Pr[vi,Ui G Qc,q\ < 
p ■ Pr[Ai]. We work similarly for Pr[vi, Ui G Qq,c\- The theorem follows. 

5.2 Proof sketch for Proposition [1] 

In the experiment in the statement of Proposition [T] we let Wi{l) denote the number of paths in Qcg that 
start at vi and end at ui and have length I. By the Markov inequality we get that 

oo 

Pr[Ai\< Y, E[Wi{l)\, (4) 

i=7logn 

where 7 = (9 log d)^^. Thus it remains to bound the expectation on the r.h.s. 

For a vertex w, we let degi{w) be its degree in the graph Gj. Consider the product measure V{Gi, k) 
such that each vertex ti; G Gj is disagreeing with probability q^ = k^dlqiw] ^'^'^ non-disagreeing with 
probability 1 — q^. Also, the vertex vi is disagreeing with probability 1. When k < degi{w) we set 
q^ = 1. A path of disagreement in Gi is any simple path which has all its vertices disagreeing. 

Let rj(/) denote the number of paths of disagreement between Vi and Ui in Gi, in a configuration 
chosen according to V{Gi, k). Through a stochastic order relation we show that for any I it holds 

E[Wi{l)]<Ev[Ti{l)], (5) 

where the rightmost expectation is w.r.t. both the measure V{Gi, k) and Gi. Then taking /c > (2 + e)d 
and sufficiently large d it holds that 

Erm)\ < 9(1) . n'- (^)' . (iTT^)' = e(l)i(l +^/S)-'- ») 

The coefficient n^~^ comes from the fact that between Vi and Vi there are at most n^~^ paths of length I, 
{d/nY is an upper bound for the probability to have a specific path of length I in Gi and the final coef- 
ficient is related to the probability for a path of length / to be a "path of disagreement". The proposition 
follows by combining (lU),© and ^. 

To get a better picture of why there are not many paths of disagreement when k > {2 + e)d consider 
qyj the marginal distribution of w in Gi to be disagreeing. For fc > (2 + e)(i it holds that 

^" - {l+\/2)d + P'^^'^^Siiw) > (1 + e/2)d + 1]. 

Clearly, degi{w) is dominated by B{n, d/n). Using Chemoff bounds Q we can show that for any fixed e, 
the rightmost probability is smaller than e'^ '^, for fixed c'. Then, roughly speaking, we have the following 
situation: The expected degree of w is at most d. Also, w is disagreeing with probability q^^, < 1/d, for 
sufficiently lai^ge d. Consequently, for every path of disagreement that enters w the expected number of 
paths that leave w are d ■ q^ < 1. 



*Corollary2.4in(5]. 
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Appendix 

A Proof of Theorem [1] and Theorem |2] 

In this section we use results from Section |4] to show Theorem [T] and Theorem |2] when the input of the 
random colouring algorithm is an instance of G{n, d/n). Essentially there are two issues to deal with, 
the first is how do we construct the sequence of subgraphs, while the second is to replace the, rather 
general. Assumptions |2] with more specific results for the colourings of the graphs Go, Gi, . . . , G,-. 

It is easy to construct a sequence of subgraph so as to have Go randomly fc-coloured in polynomial 
time (e.g. take it such that Gq is empty). However, the actual construction of the sequence of subgraphs 
is a bit more complicated task. It has been remarked very early in this work that in the graph Gj the 
vertices Vi and Uj are at a sufficiently large distance. To see why we need this property we provide the 
following corollary, which follows directly from the definitions in the previous sections. 

Corollary 3 Consider some fixed graph Gi and c,q £ [k]. The set rij(c, c) is a-isomorphic to ^i{q, c) 
with a-function H(-,q) if and only if the following holds: Choose u.a.r a colouring from Vli[c^c) and 
let Qc,q be the disagreement graph specified by this colouring and q. It should hold that 

a> max Pr[vi,Ui £ Qc,q\Gi]. (7) 

qelk]\{c} 

Additionally, the analogous condition should hold for a random colouring of^i{q, c). 

Since we are interested in the minimum possible value for a, we see that the greater the distance between 
Vi and Ui the less probable is for the disagreement graph to include them both. Thus, the greater the 
distance between Vi and Ui the more accurate the random colouring algorithm gets. To this end we use 
the following lemma to construct the sequence of subgraphs of Gn^d/n- 

Lemma 5 With probability at least 1 — n~'^'^ we can have the sequence Gq, . . .Gr satisfying the fol- 
lowing two properties. 

1. Go consists only of isolated vertices and simple cycles, each of maximum length less than q°^"^ . 

2. In Gi, the graph distance between Vi and Ui is at least q°^"^ . 

Additionally it holds that 

Pr[r > (1 + n-^'^)dn/2] < exp(-?i^/'^). 

The proof of Lemma |5] appears in Section IC6l 

In the analysis that follows, we assume that the sequence of subgraphs that is computed by the 
random colouring algorithm, has the properties stated in Lemma |5] Since Go, ... G^ are subgraphs of 
Gn,d/n^ somehow they are random too and they depend on d. The reader should feel free to assume any, 
arbitrary, rule that generates Gj from each instance of G{n, d/n). The only restriction we have is that 
of the distance between Vi and Ui. 

Theorem 7 Take k > {2 + e)d, where e > and d is a sufficiently large fixed number. There is /3j such 
that for any a> (5 it holds that ^i{c, c) is a-isomorphic to Vti{c, q) and H{-, q) is an a-function while 

E[ft| < (i^„-('+-i 

for any c,q£ [k] and i = 0, . . . ,r. 

A.1 



Since the graphs Gi are random the coiTesponding sets Qi are random too. The above expectation is 
taken w.r.t. the random graph Gj, for i = 0, . . . , r — 1. The proof of Theorem |7] appears in Section IBJ 

Proof of Theorem [1} Using Theorem |5] and Theorem |7] we have that 



E[\\i,-m<E 



.4 = . 



where the expectation is taken over the instances of the input Gn^d/n- Noting that /3j € [0, 1], we get 

(l+n-i/=')dn/2 

E{\\^i-fi\\]< ^ E[Pi\r<{l + n-^'^)dn/2\+ri^Pr[r>{l + n-'^/^)dn/2]. 

i=0 

It is direct that 

E[Pi\r < (1 + n-^/^)dn/2] < Pr'^lr < (1 + n-^^^)dn/2] ■ ^[A] < ^ (40 + 8e)fe ^_(i+_L_) 
in the final inequality we used Theorem|7J Combining all the above with Lemma[5l we get that 

E[\\fi- ftW] <C -n"^^^. 
for fixed C > 0. The theorem follows by applying the Markov inequality. 

Proof of Theorem 12 As we show in the proof of Lemma[5l with probability at least 1 — exp(— n^'^), 
the number of edges of Gn^d/n is at most (1 + n"^/^)^. From now on in the proof, assume that we are 
dealing with a graph with 0(n) edges. In this case it is direct that r = Q{n), as well. 

Since the number of edges is linear, we need 0{n) time to find whether some edge belongs to a small 
cycle, i.e. a cycle of length less than g^^, or not. This can be done by exploring the structure of the 

g^^ -neighbourhood around this edge. Thus, the algorithm requires 0(?i^) time to create the sequence 
of subgraph. 

Also, it is clear that we need 0{n) time to implement one switching of a colouring. For more details 
on how this can be done see in the proof of Lemma|3l Since r = 0{n), we need 0{v?) time for the all 
colour switchings in the algorithm. 

As far- as the random colouring of Go is regarded we note the following: Using Dynamic Program- 
ming we can compute exactly the number of list colourings of a tree T. In the list colouring problem 
every vertex v £ T has a set List{v) of valid colours, where List{v) C [k] and v only receives a colour 
in List{v). For a tree on / vertices, using dynamic programming we can compute the exact number of 
list colourings in time Ik. For a unicyclic component, i.e. a tree with an extra edge, we can consider 
all the k"^ colourings of the endpoints of the extra edges and for each of these colourings recurse on the 
remaining tree. Thus, it is direct to show that we can have a random k- colouring of Go in time 0{n). 

The theorem follows by noting that the construction of the sequence of subgraphs with the desired 
properties fails with probability at most n~^/^. 



B Proof of Theorem |7] 

So as to prove Theorem |7] we use the following proposition. 
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Proposition 2 Take k > (2 + e)d, where e > and d is a sufficiently large number. Let a be a k- 
colouring of Gi that is chosen u.a.r among Uc'g[w^j(c, c'). For some q € [fc]\{c} we let the event 
Ai = "vi and Ui G Q^^ , ^ ". It holds that 

10 + 2e _fi+ 



Pr[Ai]< ^ V ^45iogd; i = 0, ...,r. 

The reader should remark that since the graph Gi is random, for the probability term Pr[Aj] in the 
proposition it holds that 

Pr[Ai] = E[Pr[Ai\Gi]], 

where the expectation w.r.t. Gi. The proof of Proposition |2] appears in Section IbTT] 

Proof of Theorem |7t Consider, first, a fixed sequence Gi, for i = 0, 1 . . . , r. Assume that we choose 
a A;-colouring u.a.r. among ilj(c, c) and let Qc^q be the disagreement graph specified by the chosen 
/c-colouring and q. Let the event Bi = 'vi,Ui G Qc,q" in the above experiment. 

Similarly, assume that we choose u.a.r. a A;-colouring from Qi{q, c) and let Qq^c be the disagreement 
graph specified by the chosen A;-colouring and q. Let the event Cj ="vi,Ui £ Qc,q" in this experiment. 

We let /3i = m.ayi{Pr[Bi\Q,i{c, c)], Pr[Ci\^i{q, c)]}. Corollary |3] implies that for any a > /3j it 
holds that the set Qi{c, c) is a-isomorphic to 0,i{q, c) with a-function H{-, q). Also, it is straightforward 
that 

E[(3,]<Pr[Bi] + Pr[C,]. 

The above expectation is taken w.r.t. to the instances Gi. 

Assume that we choose u.a.r. a member of a fixed instance of Uc'grwf^j(c, c') and we denote with 
Ei the event that the chosen colouring belongs to r2j(c, c). Also let 

p = Y,PrmG]-V,[G], 
G 

where, for a fixed graph G, Pr[Ei\G] is equal to the probability to have the event Ei when the sets of 
fc-colourings are specified by the graph G. Vi{G) is equal to the probability that an instance of Gi is the 
graph G. Applying the law of total probability we get that 

Pr[Ai] = Pr[Ai\Ei]Pr[E,]+Pr[Ai\Et]Pr[Et] 
> Pr[Ai\Ei]Pr[Ei] = Pr[B,] ■ p. 

Thus, it holds that 

Pr[Bi] <p~^Pr[Ai]. 

Since we have the value of Pr[Aj] from Proposition |2j we only need to compute a lower bound for the 
probability p. For a fixed graph G, let fic denote the Gibbs distribution of the fc-colourings of G. Also 
let fii be defined as follows: 

fi,{a) = Y,l^G{'j)mG) \fae[k]^. 
a 

We use the following claim to compute bounds for p. 

Claim 1 Taking fc > (2 + e)d, where e > is fixed and d is a sufficiently large number, it holds that 

max||/ij(-|cr„) - Hi{-)\\u < ra"-^ i = 0,. . . ,r. 
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It is easy to show that under //j the marginal distribution of the colour assignment of the vertex u is the 
uniform over the set [k]. The above claim suggests that for A; > (2 + e)d it holds that 



1 
P-k 



<n-\ 



Thus we get that 



Prm < 2kPr\A,] < i^^l±i!l^n~('+^ 



Using the same arguments we, also, get that 

The theorem follows. 

Proof of Claim [ij First assume that we have a fixed Gi, i.e. the set of colourings is fixed. Let Xi be 
distributed uniformly over [J^/^!^Qi{c, c') and let Zj be distributed uniformly over Qi. We couple these 
two variables . The coupling is done as follows. Choose u.a.r. a colour from [k], and set Zi{vi) equal to 
this colour, e.g. let Zi{vi) = q. We have two cases. 

li q = c, then we can have an identical coupling between Zi and Xj. Otherwise, i.e. if Zi{v) ^ 
Xi{vi), we can set Zi = H{Xi, q). 

Claim 2 In the later case, i.e. when Zi = H(Xi, q), Zi is distributed uniformly over the colouring of 
Gi that assign the vertex Vi the colour q. 

The proof of the claim appear after the end of this proof. 

Thus, in the case where Zi{v) ^ Xi{v) and we set Zi = H{Xi, q) it is direct to see that Zi{ui) ^ 
Xi{ui) if and only if the event Ai (as defined in the statement of Proposition |2l) holds. Thus we get that 

Pr[X,{u,) ^ Z,{ui)\G.-^ < Pr[Ai\G.-^ 
From the above relation and Proposition |2] we get that 

Pr[Xi{ui) / Z,{ui)\ < Pr[A.-^ < n-\ 
The claim follows by using the Coupling Lemma. 

Proof of Claim |2l We remind the reader that Xi is distributed uniformly at random among the k- 
colourings of Gj that assign the vertex Vi the colour c. It suffice to show that the sets Qc = ^c'e[k]^i{c, c') 
and Q.q = Uc'grw^jlg, c') are isomorphic with bijection H{-, q) : Qc ^ ^q- The arguments we need to 
show this are the same as those we use in the proof of Lemma|2] 

I.e. first we need to show that for any a G Oc it holds that H{a, q) is a proper colouring of Gi. 
Clearly this holds (see the first two paragraphs of the proof of Lemma|2]in section IC21 ) Second we need 
to show that the mapping H{-, q) : ^c ^ ^q is surjective, i.e. for any a G fig there is a a' € Oc such 
that a = H{a', q). It is direct to see that such a' exists, moreover, it holds that a' = H{a, c). Finally, we 
needto show that H{-,q) is one-to-one, i.e. there are no two cJi , (72 G J7c such that i?((Ti, g) = H{a2,q). 
Using arguments similar to for the surjective case it is direct to see that there cannot be such a pair of 
colourings. The claim follows. 
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B.l Proof of Proposition |2] 

Consider the probability distribution C{Gi, k) (or Cd^k) induced by the following experiment. We have 
a graph Gi and we choose a /c-colouring a u.a.r. from Uc'g[fc]rij(c, c'), where c € [k]. Choose u.a.r. a 
colour from [A;]\{c}, let q be that colour. Create the graph of disagreement Qc,q- If w € Qc,q, then w is 
"disagreeing" otherwise it is "non-disagreeing". By definition vi is always in the disagreement graph. 

For a vertex w, we denote with degi{w) its degree in the graph Gi. Consider, also, the product 
measure V{Gi, k) such that each vertex w G Gj is disagreeing with probability q^ = ^_^l ,,^\ and 
non-disagreeing with probability 1 — g^. Also, the vertex vi is disagreeing with probability 1. When 
k < degi{w) we set q^ = 1. 

A path of disagreement in Gi is any simple path which has all its vertices disagreeing. The measure 
V{Gi, k) will turn out to be very useful because it dominates C{Gi, k) in the following sense. 

Lemma 6 Let M = xi,X2, ■ ■ ■ ,xi be a path in Gi such that Vi = xi. Let the event E ="M is a path 
of disagreement" . It holds that 

Proof: Let the event Ei ="xi is disagreeing", for i <l, obviously E = 1^7=1 Ej. It is direct that 

I 

CG,,k[E] = C[Ei\J{CG,,k[E,\ n^,ll Es]. 
i=2 

The path of disagreement is specified by a random colouring from Uc/g[fc]r2j(c, c'), call this random 
colouring X. Let also Nj be the vertices which are adjacent to the vertex Xj. W.l.o.g. assume that 
k > degi{xj), for j < I. Clearly it holds that 

CG„k[E,\ nil} Es] < max Cg^,,[E,\X{N,) = a^J < } . 

'^^Uc' e [k] ^i (c>c ) K — aegi {Xj ) 

Thus 

The lemma follows. 

The following corollary is straightforward. 

Corollary 4 Let M = xi,X2, ■ ■ ■ ,xi be a path in Gi. Let the event E = "M is a path of disagreement" . 
It holds that 

VgAE]<Vg.+,AE]- 
We, also, need the following lemma for the proof of Proposition |2] 

Lemma 7 Consider the product measure V{Gn^d/n-i ^)' fa'' k > (2 + e)d, for fixed e > 0. Let vr be 
a permutation of I + 1 vertices of Gn^d/n> for < / < ©(log^n). There exists do(e), such that for 
d > dQ{e) it holds that 

Vg„^,,^A-^ is a path of disagreement] <l^^ . U__i__ + 3^-0.95 j +2n-"^^'A. 
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Proof: Call vr the path that corresponds to the permutation vr, e.g. tt = {xi, . . . x^+i). Let T be an 
indicator variable such that F = 1 if vr is a path of disagreement and T = 0, otherwise. Let, also, /^ be 
the event that there exists the path (xi, . . . , xi^i) in Gn^d/n- It holds that 



d^' 



Ev[T\ = (-) -EviT^]. 

Let Q^ denote the event that the vertices in vr have degree less than log^ n. Using Chemoff bounds it is 
easy to show that Pr[(5^|I^] > 1 - n"^°s''{n). Also, it holds that 

Ev[T\h] = E'p[T\h,Q^]Pr[Q^%] + Er[T\h,Q^]Pr[Q^%] 

It suffice to show that for < Z < 0(log^ n) and sufficiently large n it holds that 

Ev[T\h,QA < ((Y^^ +3n-°-9') . (8) 

We show ^ by induction on /. Clearly for Z = the inequality in ^ is true. Assuming that ([8]l holds 
for / = Zq, we will show that it holds ioi I = Iq + I, as well. 

For a vertex w, we let D{w) denote the event that this vertex is disagreeing. Given that all vertices 
in {xi , . . . , x/g } are disagreeing we let degout {^i) be the number of vertices in V\{xi , . . . , x/„ } that are 
adjacent to Xi, for 1 < i < Iq. If degout{xi) = t, then all the possible subsets of V\{xi, . . . ,xig} with 
cardinality t are equiprobably adjacent to Xj. This implies that 

Pr[xiQ^i is adjacent to Xj] = — — — for < i < /q — 1- 

n — Iq 

Let degin{xl^J+l) be the number of neighbours of x^y+i in {xi, . . . ,x/q_i}. By the linearity of expecta- 
tion we have 

E[deginixi,+i)\h,Q^] < -^E[degoutixi)\U,Q^] < n-^-^\ (9) 

n — Lq 

We make the simplifying assumption that if the vertex x;„+i is adjacent to any vertex in {xi, . . . , Xig_i}, 
then it is disagreeing, regardless of the number of adjacent vertices outside the path. By Q and the 
Markov inequality, we get that 



Pr[degi„,(xzo+i) > Q\h,Q-„] < E[degin{xig+i)\lTr,QTr] < n 



-0.95 



We denote with E the event that "(^^i, . . . , x/g) is a path of disagreement, degin{xi^^j^i) = 0, the edge 
{x^Q, XiQ_|_i} appears in Gn^d/n ^"'^ ^'^^ event Q^^ holds". It is easy to show that Pr[E] > 1 — 2n~^-^^. 
It holds that 



Pr[D{xig+i)\E] < "^ Pr[D{xi,,+i)\E, degout{xio+i) = j]Pr[degout{xio+i) = j\E] 

3=0 

fc-i 

-0.95^ 



j=o •? VJ/ 



+ (1 + 3n-0-95) Y^ ('"'^ {d/ny{l - d/n)''-^ 

j=k 



< q{k,d) + 3n-°-^^ 
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where 

The following inequalities ai^e straightforward. 

fc/2 . s " / \ 

j=0 •? VJ/ j=fc/2+l ^-^^ 

2 

< -+Pr[5(n,(i/?i) > (l + e/2)(i+ll. 

Using Chemoff bounds, i.e. Corollary 2.4 from 151 we get that 

Pr[B{n, d/n) > (1 + e/2)d + 1] < exp(-c'(i) 

where c' = log e — 1 + -^^ > 0. It is clear that taking A; > (2 + e)d for fixed e > 0, there is sufficiently 
large do(e) such that for d > do{e) it holds that 

,, „ 2 + 2/d 1 

q{k,d) < -^ < 



(1 + €/A)d 
The lemma follows. 

Proof of Proposition |2j Let the event B ="vi and Uj ai^e connected through a path of disagreement of 
length at most log^ n". Also, let the event C ="uj and Uj are connected through a path of length greater 
than log^ n". Clearly it holds that 

CG„k[A]<J^G,,k[B] + CG,AC], 

where Lci.fc is the probability distribution we defined at the begining of this section. When there is no 
danger of confusion we drop the subscript Gi , k The proposition will follow by calculating the proba- 
bilities £[S] and£[C]. 

Consider an enumeration of all the permutations of / vertices in Gi with first the vertex Vi and last 
the vertex Ui. Let tto{1),tti{1), ... be the permutations in the order they appear in the enumeration. Let 
Tj{l) be the random variable such that 

,, ( 1 the path that corresponds to 7rj(/) is a path of disagreement 

■^ \ otherwise. 

Let, also, r(/) = ^ rj(0- It is easy to see that the number of sumads in the previous sum ai^e at most 
n'~^. Towards computing C{C), we need to calculate the following expectation 



Ec 



log n 



Er(0 

l=lo 



where Iq = q°^"^ . However, we have to take into consideration that we have conditioned that Vi and Ui 
are at distance at least g^^j. To this end, it is direct to show that if Z the number of paths of length at 



most g^Tj — 1 between two vertices of Gi, then 



E[Z] < Y. -"^ (^) ^ - 



I 

9/10 



' -Ologd 
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Thus, letting p be the probabihty of the event that two vertices are at distance is at least qj° ^ the Markov 
inequality suggests that p > 1 — n^^/^'^. Using Lemma[6l Lemma|7]and Corollary |4] we get that 



Ec 



log n 
l=lo 



- '''t/^i^' {{(TT^^ 



+ 3n 



-0.95 



2n 



- log n 



< 



-0.95^' , nJ^~log*n 



^ l=lo 
Note that Sn"^°^ " = 0{n~^°^ "), for / = 0(log^ n). Thus, for sufficiendy large n and d we get that 



log n 

Er(0 



- -^ 2n V 5 



1 



3 / e\-'o 
- 2^V "^5/ l-(l + e/5)-i 



15 + 3e -fi+W^ 

< ■ n V^«lo: 

- 2e 



igd 



Using the Markov inequahty we get that 

C[B] < Ec 



log^ n 

Er(0 

l>lo 



15 + 3e _(i+_ 

< n V 4. 

- 2e 



logd , 



(10) 



Let P(/) be the number of paths of disagreement between Vi and any vertex of Gt, that have length 
/. It is direct that 

C[C] <Pr[P{log^n) >0] . 

The above inequality follows by noting that so as to have a path of disagreement connecting Vi and Ui 
which has length at least I, we should have some path of disagreement of length / leaving Vi. Using 
Markov's inequality we get that 



Pr [P(log2 n) > O] < Ec [P(log^ n)] 



< p n 



l^log n— 1 



n 



log n 



1 



;i + 6/4)d 



+ 3n 



log-^ n ^ 

-0.95 \ , o^-log^n 



+ 271" 



pn \ / 



The proposition follows. 







C Proofs 

C.l Lemma [1] 

Proof: Let x be a r.v. distributed as in u. The proof of this lemma is going to be made by coupling x 
and z'. In particular, we show that there is a coupling of x and z' such that 

Pr[x 7^ z'] < a. 
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Then the lemma will follow by using Coupling Lemma lUl. Let {Q[, ^2) be the isomorphic pair of the 
a-isomorphism between J7i and ^2- Observe that \^'^\ > (1 — Q)|rij|, for i = 1,2. Also, it holds that 



Pr[z' = a\z G n[] = 77^7 Vct e n'^. 

Note that when we restrict the input of the a-function H only to members of Q,[, then H is by definition 
a bijection between the sets Q,[ and 0,2- The above equality then follows by using Corollary [T] and noting 
that conditional on the fact that z G Q,[, z is distributed uniformly over Q[. Also it is easy to get that 

Pr[x = a\x G Q'^] = -^ Va G flL 

mi 

Let 

p = min{Pr[x G fi2],-Pr[z G n'^]} > 1 - a. (11) 

The above inequality follows from the assumption that Qi and ^2 are a-isomorphic. 

It is clear that we can have a coupling between x, z and z' such that the event E ="z G il'i and 
X G ^2" holds with probability p (see (fTTl) ). In this coupling, if the event E holds, then x and y are 
distributed uniformly over ^^2 and W^, respectively. This means that we can make an extra arrangement 
such that when E holds to have x = H{z), as well. Since z' = H{z), it is direct that when the event 
E holds we, also, have that x = z' . We conclude that it the above coupling it holds that Pr[x = z'\ > 
Pr[E]. Thus, 

Pr[x + z\<\- Pr\E\ = 1 - p = a. 

The lemma follows. <0 



C.2 Lemma H 

Proof: First we are going to show that for any a G S{c, c), it holds that H{a, q) is a proper colouring of 
G. Assume the contrary, i.e. that there is cr G S{c, c) such that H{a, q) is a non-proper colouring, i.e. 
there is a monochromatic edge e. Let Qa^^q be the disagreement graph specified by a and q. It is direct 
that the monochromatic edge is either incident to two vertices in Q(Tv,q or to some vertex in Qav,q aid 
some vertex outside the disagreement graph. 

It is direct that H{a, q) does not cause any monochromatic edge between two vertices in Qa^.q- To 
see this, note that the disagreement graph is bipartite and a specifies exactly one colour for each part of 
the graph, while H{(t, q) switches the colours of the two parts. On the other hand, H{a, q) cannot cause 
any monochromatic edge between a vertex in Qa^^q and some vertex outside the disagreement graph. 
This follows by the fact that the disagreement graph is maximal. Thus, there is no edge w outside Qa^.q 
such that Gw G {q, a^} while at the same time w is adjacent to some vertex in Qa^^q- 

Also, it is direct to show that for any a G S{c, c), it holds that H{cj, q) G S{q, c). This follows by 
the definition of the sets S{c, c) and S{c, q). It remains to show that H{-, q) : S{c, c) — )■ S{q, c) is a 
bijection. 

We show that H{-,q) has range the set S{q,c), i.e. it is surjective map, ie. for any a G S{q,c) 
there is a' G S{c, c) such that a = H{a', q). It is direct to see that such a' exists, moreover, it holds 
a' = H{a,c). 

Finally, we need to show that H{-, q) is one-to-one, i.e. there are no two ai, 0-2 G 5(c, c) such that 
H{ai, q) = H{a2,q). Using arguments similar to those in the previous paragraph it is direct to see that 
there cannot be such a pair of colourings. 

Thus, since H{-,q) : S{c,c) — )• S{q,c) is surjective and one-to-one it is a bijection. The lemma 
follows. 



A.9 



C.3 Theorem |3] 

Proof: Let X be the input of STEP, i.e. a random fc-colouring of G. Let Y be equal to the colouring 
that is returned by the algorithm. Also, let Z be a random variable distributed as in v. The proof of the 
theorem is going to be made by coupling Z and Y and by showing that in this coupling it holds that 

Pr[Z / y] < a. 

The reader should observe that for any q,c £ [k] such that c / g, it holds that 

Pr[Z{v) = q\Z{u) = c] = Pr[X{v) = q\X{u) = c, X{v) ^ c] = -^ (12) 

and 

Pr\X{v) = X{u) = c\Xi?,h&A] = \, (13) 

k 

due to symmetry. Also, it is direct to show that 

Pr\Yiv) = q\X{u) =c] = -^ (14) 

k — \ 

for every q E [fe]\{c}. Now we are going to construct the coupling. We need to involve the variable 
X, the input of STEP, in this coupling. First, set Z{u) = X{u) and then set Z{v) = Y{v). Using the 
above observations it is straightforward to show that Z{u) and Z{v) are set, respectively, according to 
the appropriate distribution (due to (IT2l) and (fT4l) ). 

We reveal the values of X{v), X{u) and Y{v). By the above coupling we also have the values of 
Z{v) and Z{u). We consider two cases, depending on whether X is a good or a bad colouring. 

If X{v) 7^ X{u), i.e. X is good, then we have X = Y and we can set directly X = Z. Thus, for 
the coupling it holds 

Pr[Y ^ Z\X is good] = 0. 

If X{u) = X{v), then w.l.o.g. we can assume X{u) = X{v) = c, for some c € [k]. In this case, we 
choose whether X G S{c, c) or not. For this choice, the Assumption [T] suggests that 

Pr[X e S{c,c)\X{u) = X{v) = c] > 1 - a. 

Similarly for Z, assume that Z{u) = c and Z{v) = q, with c ^ q. Again Assumption [T] suggests that 

Pr[Z G S{q, c) \X{u) = X{v) = c] > I - a. 

Let the event E ="X € S{c, c) and Z G S{q, c)". Having set X{v),X{u), Z{v), Z{u), Y{v), the two 
previous inequalities suggest that we can couple X and Z such that the probability of the event E to 
occur is at least 1 — a. 

Claim 3 Conditional on the event E, Y is distributed uniformly over S{q, c). 

Conditional on the event E, it is easy to observe that Z is, also, distributed uniformly over S{q, c). This 
observation and Claim [3] suggest that 

Pr[Z y^ Y\E] = 0. 

Gathering all the above together and applying the law of total probability we get the following for the 
coupling: 

Pr[Z i-Y\< Pr[Z / Y\X is good] + Pr[Z / Y\E] + Pr[E] < a. 
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The theorem follows. 

Proof of Claim |3t Conditional on the event E, the random variable X is distributed uniformly over 
S{c, c). Note that S{c, c) and S{q, c) are isomorphic, due to Lemma|2l The same lemma suggests that 
we can have a bijection between the two isomorphic sets by taking H{-, q) and restricting its input only 
to colourings in S{c, c). Thus, since X is distributed uniformly over S{c, c), H{X, q) = Y is distributed 
uniformly over S{q, c), by Corollary[T] he claim follows. 



C.4 Lemma |3] 

Proof: Note that the time complexity of computing the value of H{a, q) is dominated by the time we 
need to reveal the disagreement graph Qa^^q, for some q € [k]. We show that we need 0(|-E|) steps to 
reveal the disagreement graph Qa^.q- 

We can reveal the graph Qa^,q in steps j = 0, . . . , |i?|, where E is the set of edges of G. At step 
the disagreement graph Qcti,,<j(0) contains only the vertex v. Given the graph Qa^^qij) we construct 
Qav,q{j + 1) as follows: Pick some edge which is incident to a vertex in Qay,q{j)- If the other end of 
this edge is incident to a vertex outside Qa^^qij) that is coloured either ay or q then we get Qa^,q{3 + 1) 
by inserting this edge and the vertex into Qa^^qij). Otherwise Qa^,q{j + 1) is the same as Qay,q{j)- We 
never pick the same edge twice. 

It is direct to show that in the above procedure it holds that Qa^.q = QcT^.q{\E\). Thus, the time 
complexity of a g-switching of a given colouring oiG is 0{\E\). 



C.5 Theorems] 

Proof: Let Xi be a random variable which is distributed uniformly over Oj, i = 0, . . . , r. It suffices to 
provide a coupling of Xj- and Yy, such that 

Pr[Xr^Yr] <r-a. 

Working as in the proof of Theorem[5]we get the following: There is a coupling of Xi,Xi^i such that 
for the event Ei = " Xi is good or there are c,q£ [k] such that Xi G Si{c, c) and Xj+i G S{q, c) it 
holds that 

Pr[Ei] >l-a. 

Now consider the random variables Z = (Xq, Xi, . . . , Xr-i) and Z' = (Xi, X2, . . . Xr) and W = 
(Yi, . . . , Yr). Consider, also, the event E = n[~Q£'j, where Ei is the event defined above. All the above 
discussion suggests two facts: First, there is a coupling between Z, Z' and W such that 

Pr{E\ > 1 - ra. 

Second, if in this coupling the event E occurs we can have Z' = W , i.e. Xi = Fj, for i = 1, . . . , r. 
To see thus, consider the following: If the event Ei occurs we can have either Xi = Xj+i or Xj+i = 
H{Xi, q) for appropriate q. When E occurs, we have this property for alH = 0, . . . , r — 1. A direct 
inductive argument implies Z' = W. The theorem follows by noting that 

Pr[Yr. ^ Xr] < l-Pr[E]. 
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C.6 Lemma m 



Proof: For (1) it suffice to show that with probability at least 1 — n~^' ^ all the cycles of length less than 
g^^ in Gn^d/n do not share edges with each other. Let 7 = (91og(d))~^. Assume the opposite, there 
are at least two cycles, each of length at least 7 log n that intersect with each other. Then, there must 
exist a subgraph of Gn^d/n that contains at most 27 log n vertices while the number of edges exceeds by 
1, or more, the number of vertices. 

Let D be the event that in Gn^d/n there exists a set of r vertices which have r + 1 edges between 
them. For r < 27 log n we have the following: 



7logn 

Pr[D] < Yl 

r=l 
7logn 



^2^ ]{d/nY+\l-d/nf2)-(^+^) 



ne\^ 



r=l 



/ne 
r 






2(r + 



< — 

n 



C /e^rfV"''"'" 



2- 



Having 27 • \og{e'^d/2) < 1, the quantity in the r.h.s. of the last inequality is o(l), in particular it is of 
order 0(n'^'°s('^ '^/2)-i) Thus, for 7 = (91ogd)~^ there is no connected component that contains two 
cycles with probability at least 1 — 2n~2/^. 

If we include in Go all the edges that belong to small cycles, i.e. of length less than q°^"^ then it is 
straightforward that (2) holds. 

For (3), we let E{Gn^d/n) be the number of edges in Gn^d/n- Using standard probabilistic tools, i.e. 
Chernoff bounds, it is direct to get that 



Pr 



EiG^,d/n) > (1 + n 



-1/3 



dn 



< exp 



-n 



1/4 



It is direct that r, the number of terms in the sequence of subgraphs of Gn^d/n^ is upper bounded by 
E{Gn^d/n)- Thus, the above inequality implies that 



Pr 



r > (1 + n 



-1/3 



dn 



< exp 



-n 



1/4 



The lemma follows. 







C.7 Proof of Corollary [I 

The existence of the bijection T impUes that |J7ij = \0,2\- Thus V^ £ f^i it holds that 

Pr[X = ^]= Pt[T{X) = nO] ^ 



\n^ 



Since, for every o" G O2 there is a unique a' € Oi such that T(cr') = cr we get that 

1 1 



Pr[T{X) = a] 



\Qi\ IO2I 



The corollary follows. 
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